A Study of UCT and Its Enhancements in an Artificial Game
نویسندگان
چکیده
Monte-Carlo tree search, especially the UCT algorithm and its enhancements, have become extremely popular. Because of the importance of this family of algorithms, a deeper understanding of when and how the different enhancements work is desirable. To avoid the hard to analyze intricacies of tournamentlevel programs in complex games, this work focuses on a simple abstract game, which is designed to be ideal for history-based heuristics such as RAVE. Experiments show the influence of game complexity and of enhancements on the performance of Monte-Carlo Tree Search.
منابع مشابه
UCT Enhancements in Chinese Checkers Using an Endgame Database
The UCT algorithm has gained popularity for use in AI for games, especially in board games. This paper assess the performance of UCT-based AIs and the effectiveness of augmenting them with a lookup table containing evaluations of games states in the game of Chinese Checkers. Our lookup table is only guaranteed to be correct during the endgame, but serves as an accurate heuristic throughout the ...
متن کاملAn Adaptive Learning Game for Autistic Children using Reinforcement Learning and Fuzzy Logic
This paper, presents an adapted serious game for rating social ability in children with autism spectrum disorder (ASD). The required measurements are obtained by challenges of the proposed serious game. The proposed serious game uses reinforcement learning concepts for being adaptive. It is based on fuzzy logic to evaluate the social ability level of the children with ASD. The game adapts itsel...
متن کاملKnowledge Generation for Improving Simulations in UCT for General Game Playing
General Game Playing (GGP) aims at developing game playing agents that are able to play a variety of games and, in the absence of pre-programmed game specific knowledge, become proficient players. Most GGP players have used standard tree-search techniques enhanced by automatic heuristic learning. The UCT algorithm, a simulation-based tree search, is a new approach and has been used successfully...
متن کاملPlaying Tetris Using Bandit-Based Monte-Carlo Planning
Tetris is a stochastic, open-ended board game. Existing artificial Tetris players often use different evaluation functions and plan for only one or two pieces in advance. In this paper, we developed an artificial player for Tetris using the bandit-based Monte-Carlo planning method (UCT). In Tetris, game states are often revisited. However, UCT does not keep the information of the game states ex...
متن کاملTomographic Determination of Temperature Distribution in Billets (RESEARCH NOTES)
The principles of Ultrasonic Computed Tomography (UCT) are reviewed in this paper. The UCT is a powerful nondestructive technique in medicine and recently in industry to reveal an image of a slice of an object or body. The advantage of UCT over other conventional techniques in imaging is that a computed tomogram yields quantitative information about the section of interest. Experimental works o...
متن کامل